(1) GENE] 




>N: 



SEQUENCE LISTING 



(i) 
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(ii) 



TITLE OF INVENTION: A VACCINE - INDUCED HEPATITIS B VIRAL 
STRAIN AND USES THEREOF 



(iii) NUMBER OF SEQUENCES: 11 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Ladas & Parry 

(B) STREET: 26 West 61 Street 

(C) CITY: New York 

(D) STATE: New York 
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(F) ZIP: 10023 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
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(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 
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(B) FILING DATE:13-DEC-2000 

(C) CLASSIFICATION : 435 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/SG98/00045 
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(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Mass, Clifford J. 

(B) REGISTRATION NUMBER: 3 0,086 

(C) REFERENCE/DOCKET NUMBER : U- 013 108 - 9 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: (212) 708-1890 

(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

CTCCACCACT TTCCACCAAA CTCTTCAAGA TCCCAGAGTC AGGGCCCTGT ACTTTCCTGC 60 

TGGTGGCTCC AGTTCAGGAA CAGTGAGCCC TGCTCAGAAT ACTGTCTCTG CCATATCGTC 12 0 

AATCTTATCG AAGACTGGGG ACCCTGTACC GAACATGGAG AACATCG CAT CAGGACTCCT 18 0 

AGGACCCCTG CTCGTGTTAC AGGCGGGGTT TTTCTTGTTG ACAAAAATCC TCACAATACC 240 



GCAGAGTCTA GACTCGTGGT GGACTTCTCT CAATTTTCTA GGGGGAACAC CCGTGTGTCT 3 00 



TGGCCAAAAT TCGCAGTCCC AAATCTCCAG TCACTCACCA ACCTGTTGTC CTCCAATTTG 3 60 

TCCTGGTTAT CGCTGGATGT GTCTGCGGCG TTTTATCATC TTCCTCTGCA TCCTGCTGCT 42 0 

ATGCCTCATC TTCTTGTTGG TTCTTCTGGA CTATCAAGGT ATGTTGCCCG TTTGTCCTCT 480 

AATTCCAGGA TCAACAACAA CCAGCACCGG AC CAT G C AAA ACCTGCACAA CTCCTGCTCA 54 0 

AGGAACCTCT ATGTTTCCCT CATGTTGCTG TACAAAACCT ACGGACAGAA ACTGCACCTG 600 

TATTCCCATC CCATCATCTT GGGCTTTCGC AAAATAC CTA TGGGAGTGGG CCTCAGTCCG 66 0 

TTTCTCTTGG CTCAGTTTAC TAGTGCCATT TGTTCAGTGG TTCGTAGGGC TTTCCCCCAC 72 0 

TGTCTGGCTT TCAGTTATAT GGATGATGTG GTTTTGGGGG CCAAGTCTGT ACAACATCTT 780 

GAGTCCCTTT ATGCCGCTGT TACCAATTTT CTTTTGTCTT TGGGTATACA TTTAAACCCT 84 0 

CACAAAACAA AAAGATGGGG ATATTCCCTT AACTTCATGG GATATGTCAT TGGGAGTTGG 90 0 

GGCACATTGC CACAGGAACA TATTGTACAA AAAATCAAAA TGTGTTTTAG GAAACTTCCT 96 0 

GTAAACAGGC CTATTGATTG GAAAGTATGT CAACGAATTG TGGGTCTTTT GGGGTTTGCC 102 0 

GCCCCTTTCA CGCAATGTGG ATATCCTGCT TTAATGCCTT TATATGCATG TATACAAGCA 1080 

AAACAGGCTT TTACTTTCTC GCAAACTTAC AAGACCTTTC TAAGTAAACA GTATCTGAAC 114 0 

CTTTACCCCG TTGCTCGGCA ACGCCCTGGT CTGTGCCAAG TGTTTGCTGA CGCAACCCCC 12 0 0 

ACTGGTTGGG GCTTGGCCAT AGGCCATCAG CGCATGCGTG GAACCTTTGT GTCTCCTCTG 12 60 

CCGATCCATA CTGCGGAACT CCTAGCCGCT TGTTTTGCTC GCAGCAGGTC TGGGGCAAAA 13 2 0 

CTCATCGGGA CTGACAATTC TGTCGTGCTC TCCCGCAAGT ATACATCATT TCCATGGCTG 13 8 0 

CTAGGCTGTG CTGCCAACTG GATCCTGCGC GGGACGTCCT TTGTTTACGT CCCGTCGGCG 144 0 

CTGAATCCCG CGGACGACCC CTCCCGGGGC CGCTTGGGGC TCTACCGCCC GCTTCTCCGC 15 0 0 

CTGTTATACC GACCGACCAC GGGGCGCACC TCTCTTTACG CGGACTCCCC GTCTGTGCCT 15 60 

TCTCATCTGC CGGACCGTGT GCACTTCGCT TCACCTCTGC ACGTCGCATG GAGACCACCG 162 0 

TGAACGCCCA CGGGAACCTG CCCAAGGTCT TGCATAAGAG GACTCTTGGA CTTTCAGCAA 16 8 0 

TGTCAACGAC CGACCTTGAG GCATACTTCA AAG AC TGTGT GTTTAATGAG TGGGAGGAGT 174 0 

TGGGGGAGGA GGTTAGGTTA AAGGTCTTTG TACTAGGAGG CTGTAGGCAT AAATTGGTGT 18 0 0 

GTTCACCATC ACCATGCAAC TTTTTCACCT CTGCCTAATC ATCTCATGTT CATGTCCTAC 18 60 

TGTTCAAGCC TCCAAGCTGT GCCTTGGGTG GCTTTGGGGC ATGGACATTG ACCCGTATAA 192 0 

AGAATTTGGA GCTTCTGTGG AGTTACTCTC TTTTTTGCCT TCTGACTTTT TTCCTTCTAT 198 0 

TCGAGATCTC CTCGACACCG CCTCTGCTCT GT AT CGGG AG GCCTTAGAGT CTCCGGAACA 2 04 0 

TTGTTCACCT C AC C AT ACGG CACTCAGGCA AGCTATTCTG AGTTGGGGTG AGTTAATGAA 210 0 

TCTAGCCACC TGGGTGGGAA GTAATTTGGA AGATCCAGCA TCCAGGGAAT TAGTAGTCAG 216 0 

CTATGTCAAC GTTAATATGG GCCTAAAAAT CAGACAACTA TTGTGGTTTC ACATTTCCTG 22 2 0 

TCTTACTTTT GGGAGAGAAA CTGTTCTTGA ATATTTGGTG TCTTTTGGAG TGTGGATTCG 22 80 



CACTCCTCCT GCATATAGAC CACCAAATGC CCCTATCTTA TCAACACTTC CGGAAACTAC 2 34 0 

TGTTGTTAGA CGAAGAGGCA GGTCCCCTAG AAGAAGAACT CCCTCGCCTC GCAGACGAAG 24 0 0 

GTCTCAATCG CCGCGTCGCA GAAGATCTCA ATCTCGGGAA TCTCAATGTT AGTATTCCTT 246 0 

GGACACATAA GGTGGGAAAC TTTACGGGGC TTTATTCTTC TACGGTACCT TGCTTTAATC 2 52 0 

CTAAATGGCA AACTCCTTCT TTTCCGGACA TTCATTTGCA GGAGGACATT CTTGATAGAT 2 58 0 

GTAAGCAATT TGTGGGGCCC CTTACAGTAA ATGAAAACAG GAGACTAAAA TTAATTATGC 2 64 0 

CTGCTAGGTT TTATCCAAAT GTTACTAAAT ATTTGCCCTT AGATAAAGGG ATCAAACCAT 270 0 

ATTATCCAGA GTATGTAGTT AATCATTACT TCCAGACGCG ACATTATTTA CACACTCTTT 276 0 

GGAAGGCGGG GATCTTATAT AAAAGAGAGT C C AC ACGT AG CGCCTCATTT TGCGGGTCAC 2 82 0 

CATATTCTTG GGAACAAGAT CTACAGCATG GGAGGTTGGT CTTCCAAACC TCGAAAAGGC 2 8 80 

ATGGGGACAA ATCTTTCTGT CCCCAATCCC CTGGGATTCT TCCCCGATCA TCAGTTGGAC 2 94 0 

CCTGCATTCA AAGC CAACTC AGAAAATCCA GATTGGGACC TCAACCCGCA CAAGGACAAC 3 0 00 

TGGCCGGACG CCAACAAGGT GGGAGTGGGA GCATTCGGGC CAGGGTTCAC CCCTCCTCAT 3 060 

GGGGGACTGT TGGGGTGGAG CCCTCAGGCT CAGGGCCTAC TCACAACTGT GCCAGCAGCT 312 0 

CCTCCTCCTG CCTCCACCAA TCGGCAGTCA GGAAGGCAGC CTACTCCCTT ATCTCCACCT 318 0 

CTAAGGGACA CTCATCCTCA GGCCATGCAG TGGAA 3 215 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Pro Leu Ser Tyr Gin His Phe Arg Lys Leu Leu Leu Leu Asp Glu 
15 10 15 

Glu Ala Gly Pro Leu Glu Glu Glu Leu Pro Arg Leu Ala Asp Glu Gly 
20 25 30 

Leu Asn Arg Arg Val Ala Glu Asp Leu Asn Leu Gly Asn Leu Asn Val 
35 40 45 

Ser lie Pro Trp Thr His Lys Val Gly Asn Phe Thr Gly Leu Tyr Ser 
50 55 60 

Ser Thr Val Pro Cys Phe Asn Pro Lys Trp Gin Thr Pro Ser Phe Pro 
65 70 75 80 

Asp lie His Leu Gin Glu Asp lie Leu Asp Arg Cys Lys Gin Phe Val 
85 90 95 

Glu Pro Leu Thr Val Asn Glu Asn Arg Arg Leu Lys Leu lie Met Pro 
100 105 110 



Ala Arg Phe Tyr Pro Asn Val Thr Lys Tyr Leu Pro Leu Asp Lys Gly 
115 120 125 

lie Lys Pro Tyr Tyr Pro Glu Tyr Val Val Asn His Tyr Phe Gin Thr 
130 135 140 

Arg His Tyr Leu His Thr Leu Trp Lys Ala Gly lie Leu Tyr Lys Arg 
145 150 155 160 

Glu Ser Thr Arg Ser Ala Ser Phe Cys Gly Ser Pro Tyr Ser Trp Glu 
165 170 175 

Gin Asp Leu Gin His Gly Arg Leu Val Phe Gin Thr Ser Lys Arg His 
180 185 190 

Gly Asp Lys Ser Phe Cys Pro Glu Ser Pro Gly lie Leu Pro Arg Ser 
195 200 205 

Ser Val Gly Pro Cys lie Gin Ser Gin Leu Arg Lys Ser Arg Leu Gly 
210 215 220 

Pro Gin Pro Ala Gin Gly Gin Leu Ala Gly Arg Gin Gin Gly Gly Ser 
225 230 235 240 

Gly Ser lie Arg Ala Arg Val His Pro Ser Ser Trp Gly Thr Val Gly 
245 250 255 

Val Glu Pro Ser Gly Ser Gly Pro Thr His Asn Cys Ala Ser Ser Ser 
260 265 270 

Ser Ser Cys Leu His Gin Ser Ala Val Arg Lys Ala Ala Tyr Ser Leu 
275 280 285 

lie Ser Thr Ser Lys Gly His Ser Ser Ser Gly His Ala Val Glu Leu 
290 295 300 

His His Phe Pro Pro Asn Ser Ser Arg Ser Gin Ser Gin Gly Pro Val 
305 310 315 320 

Leu Ser Cys Trp Trp Leu Gin Phe Arg Asn Ser Glu Pro Cys Ser Glu 
325 330 335 

Tyr Cys Leu Cys His lie Val Asn Leu lie Glu Asp Trp Gly Pro Cys 
340 345 350 

Thr Glu His Gly Glu His Arg lie Arg Thr Pro Arg Thr Pro Ala Arg 
355 360 365 

Val Thr Gly Gly Val Phe Leu Val Asp Lys Asn Pro His Asn Thr Ala 
370 375 380 

Glu Ser Arg Leu Val Val Asp Phe Ser Gin Phe Ser Arg Gly Asn Thr 
385 390 395 400 

Arg Val Ser Trp Pro Lys Phe Ala Val Pro Asn Leu Gin Ser Leu Thr 
405 410 415 

Asn Leu Leu Ser Ser Asn Leu Ser Trp Leu Ser Leu Asp Val Ser Ala 
420 425 430 

Ala Phe Tyr His Leu Pro Leu His Pro Ala Ala Met Pro His Leu Leu 
435 440 445 

Val Gly Ser Ser Gly Leu Ser Arg Tyr Val Ala Arg Leu Ser Ser Asn 
450 455 460 



Ser Arg lie Asn 
465 

Ser Cys Ser Arg 



Tyr Gly Gin Lys 
500 

Arg Lys lie Pro 
515 

Phe Thr Ser Ala 
530 

Leu Ala Phe Ser 
545 

Gin His Leu Glu 



Leu Gly lie His 
580 

Leu Asn Phe Met 
595 



Asn Asn Glu His 
470 

Asn Leu Tyr Val 
485 

Leu His Leu Tyr 



Met Gly Val Gly 
520 

lie Cys Ser Val 
535 

Tyr Met Asp Asp 
550 

Ser Leu Tyr Ala 
565 

Leu Asn Pro His 



Gly Tyr Val lie 
600 



Arg Thr Met Glu 
475 

Ser Leu Met Leu 
490 

Ser His Pro lie 
505 

Leu Ser Pro Phe 



Val Arg Arg Ala 
540 

Val Val Leu Gly 
555 

Ala Val Thr Asn 
570 

Lys Thr Lys Arg 
585 

Gly Ser Trp Gly 



Asn Leu His Asn 
480 

Leu Tyr Lys Thr 
495 

lie Leu Gly Phe 
510 

Leu Leu Ala Gin 
525 

Phe Pro His Cys 



Ala Lys Ser Val 
560 

Phe Leu Leu Ser 
575 

Trp Gly Tyr Ser 
590 

Thr Leu Pro Gin 
605 



Glu His lie Val Gin Lys lie Lys Met Cys Phe Arg Lys Leu Pro Val 
610 615 620 

Asn Arg Pro lie Asp Trp Lys Val Cys Gin Arg lie Val Gly Leu Leu 
625 630 635 640 

Gly Phe Ala Ala Pro Phe Thr Gin Cys Gly Tyr Pro Ala Leu Met Pro 
645 650 655 

Leu Tyr Ala Cys lie Gin Ala Lys Gin Ala Phe Thr Phe Ser Gin Thr 
660 665 670 

Tyr Lys Thr Phe Leu Ser Lys Gin Tyr Leu Asn Leu Tyr Pro Val Ala 
675 680 685 

Arg Gin Arg Pro Gly Leu Cys Glu Val Phe Ala Asp Ala Thr Pro Thr 
690 695 700 

Gly Trp Gly Leu Ala lie Gly His Gin Arg Met Arg Gly Thr Phe Val 
705 710 715 720 

Ser Pro Leu Pro lie His Thr Ala Glu Leu Leu Ala Ala Cys Phe Ala 
725 730 735 

Arg Ser Arg Ser Gly Ala Lys Leu lie Gly Thr Asp Asn Ser Val Val 
740 745 750 

Leu Ser Arg Lys Tyr Thr Ser Phe Pro Trp Leu Leu Gly Cys Ala Ala 
755 760 765 

Asn Trp lie Leu Arg Gly Thr Ser Phe Val Tyr Val Pro Ser Ala Leu 
770 775 780 

Asn Pro Ala Asp Asp Pro Ser Arg Gly Arg Leu Gly Leu Tyr Arg Pro 
785 790 795 800 



Leu Leu Arg Leu Leu Tyr Arg Pro Thr Thr Gly Arg Thr Ser Leu Tyr 



805 



810 



815 



Ala Asp Ser Pro Ser Val Pro Ser His Leu Pro Asp Arg Val His Phe 
820 825 830 

Ala Ser Pro Leu His Val Ala Trp Arg Pro Pro 
835 840 



INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DEi 

Met Gly Gly Trp 
1 

Ser Val Pro Asn 
20 

Ala Phe Lys Ala 
35 

Lys Asp Asn Trp 
50 

Pro Gly Phe Thr 
65 

Ala Gin Gly Leu 



Thr Asn Arg Gin 
100 

Arg Asp Thr His 
115 



CRIPTION: SEQ i: 

Ser Ser Lys Pro 
5 

Pro Leu Gly Phe 



Asn Ser Glu Asn 
40 

Pro Asp Ala Asn 
55 

Pro Pro His Gly 
70 

Leu Thr Thr Val 
85 

Ser Gly Arg Gin 



Pro Gin Ala Met 
120 



) NO : 3 : 

Arg Lys Gly Met 
10 

Phe Pro Asp His 
25 

Pro Asp Trp Asp 



Lys Val Gly Val 
60 

Gly Leu Leu Gly 
75 

Pro Ala Ala Pro 
90 

Pro Thr Pro Leu 
105 

Gin Trp Asn Ser 



Gly Thr Asn Leu 
15 

Gin Leu Asp Pro 
30 

Leu Asn Pro His 
45 

Gly Ala Phe Gly 



Trp Ser Pro Gin 
80 

Pro Pro Ala Ser 
95 

Ser Pro Pro Leu 
110 

Thr Thr Phe His 
125 



Gin Thr Leu Gin Asp Pro 
130 

Gly Ser Ser Ser Gly Thr 
145 150 

lie Ser Ser lie Leu Ser 
165 

Asn lie Ala Ser Gly Leu 
180 

Phe Phe Leu Leu Thr Lys 
195 

Trp Trp Thr Ser Leu Asn 
210 



Arg Val Arg Ala Leu 
135 

Val Ser Pro Ala Gin 
155 

Lys Thr Gly Asp Pro 
170 

Leu Gly Pro Leu Leu 
185 

lie Leu Thr lie Pro 
200 

Phe Leu Gly Gly Thr 
215 



Tyr Phe Pro Ala Gly 
140 

Asn Thr Val Ser Ala 
160 

Val Pro Asn Met Glu 
175 

Val Leu Gin Ala Gly 
190 

Gin Ser Leu Asp Ser 
205 

Pro Val Cys Leu Gly 
220 



Gin Asn Ser Gin 
225 

Pro lie Cys Pro 



Phe Leu Cys lie 
260 



Ser Gin lie Ser 
230 

Gly Tyr Arg Trp 
245 

Leu Leu Leu Cys 



Ser His Ser Pro 
235 

Met Cys Leu Arg 
250 

Leu lie Phe Leu 
265 



Thr Cys Cys Pro 
240 

Arg Phe lie lie 
255 

Leu Val Leu Leu 
270 



Asp Tyr Gin Gly 
275 

Thr Thr Ser Thr 
290 

Thr Ser Met Phe 
305 

Cys Thr Cys lie 



Trp Glu Trp Ala 
340 

Phe Val Gin Trp 
355 

lie Trp Met Met 
370 

Pro Phe Met Pro 
385 



Met Leu Pro Val 
280 

Gly Pro Cys Lys 
295 

Pro Ser Cys Cys 
310 

Pro lie Pro Ser 
325 

Ser Val Arg Phe 



Phe Val Gly Leu 
360 

Trp Phe Trp Gly 
375 

Leu Leu Pro lie 
390 



Cys Pro Leu lie 



Thr Cys Thr Thr 
300 

Cys Thr Lys Pro 
315 

Ser Trp Ala Phe 
330 

Ser Trp Leu Ser 
345 

Ser Pro Thr Val 



Pro Ser Leu Tyr 
380 

Phe Phe Cys Leu 
395 



Pro Gly Ser Thr 
285 

Pro Ala Gin Gly 



Thr Asp Arg Asn 
320 

Ala Lys Tyr Leu 
335 

Leu Leu Val Pro 
350 

Trp Leu Ser Val 
365 

Asn lie Leu Ser 



Trp Val Tyr lie 
400 



INFORMATION FOR SEQ ID NO : 4 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(xi) SEQUENCE DE, 

Met Gin Leu Phe 
1 

Val Gin Ala Ser 
20 

Asp Pro Tyr Lys 
35 

Pro Ser Asp Phe 
50 

Ala Leu Tyr Arg 
65 

His Thr Ala Leu 



ICRIPTION: SEQ I] 

His Leu Cys Leu 
5 

Lys Leu Cys Leu 



Glu Phe Gly Ala 
40 

Phe Pro Ser lie 
55 

Glu Ala Leu Glu 
70 

Arg Gin Ala lie 
85 



► NO : 4 : 

lie lie Ser Cys 
10 

Gly Trp Leu Trp 
25 

Ser Val Glu Leu 



Arg Asp Leu Leu 
60 

Ser Pro Glu His 
75 

Leu Ser Trp Gly 
90 



Ser Cys Pro Thr 
15 

Gly Met Asp lie 
30 

Leu Ser Phe Leu 
45 

Asp Thr Ala Ser 



Cys Ser Pro His 
80 

Glu Leu Met Asn 
95 



Leu Ala Thr Trp Val Gly Ser Asn Leu Glu Asp Pro Ala Ser Arg Glu 



100 



105 



110 



Leu Val Val Ser 
115 

Leu Leu Trp Phe 
130 

Leu Glu Tyr Leu 
145 

Tyr Arg Pro Pro 



Val Val Arg Arg 
180 

Arg Arg Arg Arg 
195 

Glu Ser Gin Cys 
210 



Tyr Val Asn Val 
120 

His lie Ser Cys 
135 

Val Ser Phe Gly 
150 

Asn Ala Pro lie 
165 

Arg Gly Arg Ser 



Ser Gin Ser Pro 
200 



Asn Met Gly Leu 



Leu Thr Phe Gly 
140 

Val Trp lie Arg 
155 

Leu Ser Thr Leu 
170 

Pro Arg Arg Arg 
185 

Arg Arg Arg Arg 



Lys lie Arg Gin 
125 

Arg Glu Thr Val 



Thr Pro Pro Ala 
160 

Pro Glu Thr Thr 
175 

Thr Pro Ser Pro 
190 

Ser Gin Ser Arg 
205 



(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Ala Ala Arg Leu Cys Cys Gin Leu Asp Pro Ala Arg Asp Val Leu 
15 10 15 

Cys Leu Arg Pro Val Gly Ala Glu Ser Arg Gly Arg Pro Leu Pro Gly 
20 25 30 

Pro Leu Gly Ala Leu Pro Pro Ala Ser Pro Pro Val lie Pro Thr Asp 
35 40 45 

His Gly Ala His Leu Ser Leu Arg Gly Leu Pro Val Cys Ala Phe Ser 
50 55 60 

Ser Ala Gly Pro Cys Ala Leu Arg Phe Thr Ser Ala Arg Arg Met Glu 
65 70 75 80 

Thr Thr Val Asn Ala His Gly Asn Leu Pro Lys Val Leu His Lys Arg 
85 90 95 

Thr Leu Gly Leu Ser Ala Met Ser Thr Thr Asp Leu Glu Ala Tyr Phe 
100 105 110 

Lys Asp Cys Val Phe Asn Glu Trp Glu Glu Leu Gly Glu Glu Val Arg 
115 120 125 

Leu Lys Val Phe Val Leu Gly Gly Cys Arg His Lys Leu Val Cys Ser 
130 135 140 



Pro Ser Pro Cys Asn Phe Phe Thr Ser Ala 
145 150 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 
ATAAGCTTATG CCCCTATCTT ATCAACACTT CCGGA 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 
GAGTCTAGAC TCTGCGGTAT TGTGA 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 
GAGTCTAGAC TCGTGGTGGA CTTCT 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 
TGAGAATTCT CACGGTGGTC TCCATGCGAC GT 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 



TTTGTTTACG TCCCGT 



(2) INFORMATION FOR SEQ ID NO: 11: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CTAAGCTTAG TTTCCGGAAG TGTTGAT 



